ABSTRACT
INTRODUCTION: The size and complexity of virtual screening libraries in drug discovery have skyrocketed in recent years, reaching up to multiple billions of accessible compounds. However, virtual screening of such ultra-large libraries poses several challenges associated with preparing the libraries, sampling, and pre-selection of suitable compounds. The utilization of artificial intelligence (AI)-assisted screening approaches, such as deep learning, poses a promising countermeasure to deal with this rapidly expanding chemical space. For example, various AI-driven methods were recently successfully used to identify novel small molecule inhibitors of the SARS-CoV-2 main protease (Mpro). AREAS COVERED: This review focuses on presenting various kinds of virtual screening methods suitable for dealing with ultra-large libraries. Challenges associated with these computational methodologies are discussed, and recent advances are highlighted in the example of the discovery of novel Mpro inhibitors targeting the SARS-CoV-2 virus. EXPERT OPINION: With the rapid expansion of the virtual chemical space, the methodologies for docking and screening such quantities of molecules need to keep pace. Employment of AI-driven screening compounds has already been shown to be effective in a range from a few thousand to multiple billion compounds, furthered by de novo generation of drug-like molecules without human interference.
Subject(s)
COVID-19 , SARS-CoV-2 , Humans , Artificial Intelligence , Molecular Docking Simulation , Drug Discovery/methods , Protease Inhibitors/pharmacology , Antiviral Agents/pharmacology , Antiviral Agents/chemistryABSTRACT
The SARS-CoV2 pandemic has highlighted the importance of efficient and effective methods for identification of therapeutic drugs, and in particular has laid bare the need for methods that allow exploration of the full diversity of synthesizable small molecules. While classical high-throughput screening methods may consider up to millions of molecules, virtual screening methods hold the promise of enabling appraisal of billions of candidate molecules, thus expanding the search space while concurrently reducing costs and speeding discovery. Here, we describe a new screening pipeline, called drugsniffer, that is capable of rapidly exploring drug candidates from a library of billions of molecules, and is designed to support distributed computation on cluster and cloud resources. As an example of performance, our pipeline required â¼40,000 total compute hours to screen for potential drugs targeting three SARS-CoV2 proteins among a library of â¼3.7 billion candidate molecules.